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Abstract —Pathloss is typically modeled using a log-distance 
power law with a large-scale fading term that is log-normal. 
However, the received signal is affected by the dynamic range 
and noise floor of the measurement system used to sound the 
channel, which can cause measurement samples to be truncated 
or censored. If the information about the censored samples are 
not included in the estimation method, as in ordinary least 
squares estimation, it can result in biased estimation of both 
the pathloss exponent and the large scale fading. This can be 
solved by applying a Tobit maximum-likelihood estimator, which 
provides consistent estimates for the pathloss parameters. This 
letter provides information about the Tobit maximum-likelihood 
estimator and its asymptotic variance under certain conditions. 

Index Terms —Pathloss, maximum-likelihood estimation, or¬ 
dinary least squares, censored data, truncated data, vehicular 
communication. 


1. Introduction 

Pathloss describes the expected loss in received power as a 
function of the transmitter (Tx) and receiver (Rx) separation 
distance and the effects of random large scale fading. It 
includes losses due to the expansion of the radio wave front in 
space as well as losses due to reflection, scattering, diffraction 
and penetration. A number of pathloss models have been 
developed for a variety of wireless communication systems, 
e.g., for cellular systems, Bluetooth, Wi-Fi, vehicle-to-vehicle 
communications, and, mm-wave point-to-point communica¬ 
tions, operating over different frequency bands ranging from 
hundreds of MHz to tens of GHz iiii-E). These models 
have widely been used for the prediction and simulation of 
signal strengths for given Tx-Rx separation distances. Pathloss 
models are often developed based on channel measurements in 
realistic user scenarios. The model parameters estimated from 
measurement data are thus typically valid only for a particular 
frequency range, antenna arrangement, and environment for 
the targeted user scenario. 

However, in practice, the observation of the received signal 
power at the receiver is limited by the system noise, i.e., 
the signals with power below the noise floor can not be 
measured properly. In many vehicle-to-vehicle measurements, 
this limitation due to the system noise is often present at 
longer distances 0-11I- Also, in mm-wave measurements, the 
pathloss values are in general larger than at lower frequencies, 
which effectively can reduce the range in which the data is 
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unaffected by the noise floor. Due to the limited dynamic range 
of the measurement system, sample data might be truncated, 
whereby all data above or below a certain range are immea¬ 
surable, or censored, meaning that all data above or below 
a certain range are counted, but not measured. Estimation 
of the cluster decay and cluster fading based on truncated 
data has previously been addressed in For clusters, the 
data is modeled as truncated, since it is generally impossible 
to measure or count clusters that are below the noise floor. 
However, in pathloss measurements, where distances for the 
measurement points where the received power falls below the 
noise floor are known, it is possible to model the data as being 
censored. Estimating statistical parameters without considering 
the effects of censored or truncated data samples, can lead 
to erroneous results. The fact that this can be a problem for 
pathloss data is acknowledged in however, the authors 
do not give any detailed information on how to solve this 
issue. In this letter, we discuss the use of a Tobit model 
for censored pathloss data and a maximum-likelihood 
(ML) method for the estimation of pathloss parameters (Tg. 
Supplementary material and Matlab codes can be found in a 
supporting technical report nu¬ 
ll. Pathloss Modeling 

Pathloss is often modeled by a log-distance power law plus 
a large scale fading term (H In units of dB this can be written 
as 

PL{d) = PL{do) + lOnlogjo d> do, (1) 

where d is the distance, n is the pathloss exponent, PL (do) 
is the pathloss at a reference distance of do and is a 
random variable that describes the large-scale fading around 
the distance-dependent mean pathloss. For measurement data, 
it is here assumed that the effects of small scale fading is 
removed from the data set. It is also assumed that the peak 
value of the aggregated antenna gain is removed from the 
measurement data (H) Ideally, the variation of the aggregated 
antenna gain should be small, so that it does not affect 
the measured large-scale fading too much. The large-scale 
fading term is usually modeled by a log-normal distribution, 
which in the dB-domain corresponds to a zero-mean Gaussian 
distribution with standard deviation a, i.e., ^ A/’(0,cr^). 

Hence, the pathloss is normally distributed with a distance 
dependent expected value, PL{d) ^ J\f {iJi{d)^ cf‘^), where 

^i{d) = PL{do) + lOnlogio . (2) 
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The reference value PL (do) can be estimated based on 
measurement data, or based on reference measurements at this 
distance. For line-of-sight (LOS) scenarios, it is sometimes 
deterministically modeled based on the free-space pathloss, as 


PL{do) = 201ogio • 


(3) 


Here A is the wavelength at the given frequency. Here, it 
is worth noting that the approach of using the deterministic 
reference value of Eq. only provides theoretically correct 
results if the pathloss exponent is equal to 2. If the pathloss 
exponent is not equal to 2, but Eq. Q is used to determine the 
reference value, the data model of Eq. is inconsistent, as it 
depends on the choice of the reference distance do. Eor non- 
line-of-sight (NLOS) scenarios, it is clear that the free-space 
equation 0 does not hold, which means that the reference 
value in this case must be determined in another fashion. Due 
to the above, it is preferable to use actual measurements of the 
reference level, or, to estimate it based on the measurement 
data. In some cases, it might be difficult to produce reliable 
measurements of the reference value scenarios due to practical 
reasons, especially considering that it might be hard to produce 
a large number of uncorrelated measurement samples exactly 
at do. 


HI. Estimation by Ordinary Least Squares 

To completely model the pathloss and large-scale fading 
for a given data set, we wish to estimate the three parameters 
of 1^, i.e., n, PL{do) and The data under consideration 
is implicitly assumed to be Gaussian since is Gaussian 
in the dB domain. Using Q the data set for L path loss 
measurements, y = [PI/(d/do)]^xi can be written as. 


where 


X= ^y^l01ogio(ci;/cio), 

J=1 (9) 

Sxx = Lj (101ogio((^iM) - p ■ 

1=1 

Using Eq. (9), standard error^for n and PL{do) can be found 
by replacing the unknown standard deviation of the large scale 
fading, a, by its estimate, d, which gives 

SE(n) = .7V^, 

SE(PL(do)) = 

The standard errors are useful for evaluating the accuracy 
of the estimated parameters. However, it should be stressed 
that these standard errors only applies when the data actually 
follows the log-distance power law model with a large-scale 
fading variance that is independent of delay. Eor this reason, 
it is often necessary to validate the measurement data against 
the presumed model. This could be done by investigating 
the residuals between the measured data and the regression, 
to make sure that the residuals do not exhibit any sort of 
distance dependence. If the data seems to be described by a 
different model, then a different pathloss model would have to 
be considered. The standard error of the parameters estimated 
with OLS depend on the number of samples and the exact 
pathloss sample distances that are used in the measurement. 
However, if the data is being censored, OLS would provide 
biased results, which means that Eq. (11) no longer applies. 


y = Xa + e, (4) 

where X = [1 101ogio(d/(io)]Lx2 and a = [PL{do) n]^. 

The term e = [^cr]Lxi is a row vector describing the large- 
scale fading term for each of the L different pathloss samples. 

When there are no censored samples, the parameters of the 
log-distance power law can be estimated by applying ordinary 
least squares (OLS). The parameter a is then estimated a^ 

d= (X^X)“'xV (5) 

The variance of the large-scale fading, can then be 
estimated as 

<3^ = ^(y-Xaf(y-Xa). (6) 

The estimate d is Gaussian, 

a,- - M (aj, <72 (x^X)-/) , i = 1,2, (7) 

which alternatively can be expressed as 

PL(do) = ai ~ V (PL(do), (L-1 + , 

n = a2 ^ N [n,(j , 

^As the variance P is assumed to be independent of delay, weighted least 
squares (WLS) are not applied. However, we note that WLS could be of use 
for cases when P is being modeled with a distance dependence. 


IV. Estimation of Censored Pathloss Data 

In order to estimate the pathloss exponent and fading 
variance of censored data, with a known number of missing 
samples where only the distance is known, it is possible to 
base the estimation on a censored normal distribution. Under 
this assumption, the observations follow a censored normal 
distribution 0- The censoring occurs for data samples where 
the pathloss is greater than or equal to c. The value — c is 
a channel gain that corresponds to the noise fioor of the 
channel sounder or measurement device. In practice, c is 
chosen with some margin with respect to the noise fioor, so 
that a limited number of samples dominated by noise are 
included as measurement data. Using the data set model in 
0, the data is assumed to be censored so that observations 
with values at or above c are set to c, i.e.. 


Vi = 



if y* < c 
if y* > c 


where 


y* - A/'(xja,(7^). 


( 11 ) 


( 12 ) 


^The standard error is the standard deviation of the sampling distribution 
of a statistic. 
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The probability of observing a censored observation at a 
distance d is given by 

P( 2 / > c) = 1 - $ , (13) 

where $ is the cumulative distribution function (CDF) of the 
standard normal distribution. Now, by using / as an indicator 
function that is set to 1 if the observation is uncensored and 
is set to 0 if the observation is censored, it is possible to write 
down the likelihood function as 


N r 


l{a,a) = JJ 


i=l 


a \ (j 


V 


1-^ 


c — xia 


n l-Ii 


where 0 is the standard normal probability density function 
(PDF). The log-likelihood L{a,a.) = ln[/(cr, a)] can now be 
written as 


N 


L{(j,ol) = ^h 


i=l 

N 


—Incr -f ln0 




1-^ 


Vi - Xjg 

(j 

c — xia 
a 


(14) 


Using the log-likelihood, the parameters a and a are estimated 
using 


[(J, a] = argmin{—I/(cr, a)}, (15) 

a,cx. 


which is easily solved by numerical optimization of a and 
cr, using for instance the method of Newton pQ| . In this 
work, we have solved this by using the fminsearch function in 
Matlab, which is based on a Nelder-Mead search method. The 
estimates obtained from OLS were used as initial values for 
the minimization. The presented method approach can easily 
be further extended, so that it supports other pathloss models. 


where 

A.(X,.fl,)=(Xlr' ‘cf). < 20 ) 

with coefficients 

ai = [zicpi - </>f/(l - , 

bi = <J~^ [Zi(l)i + Zi^\l (1 - $j)] /2, (21) 

Ci = [zf(j)i + Zi4)i - zfc^ll (1 - - 2i>i] /4. 

Here, (pi = (pi{zi) and = ^i{zi) and Zi = xiat/cr. In order 
to avoid numerical issues when calculating the coefficients in 
Eq. it is worthwhile to rewrite the ratio (pi/{l — ^i) as 

Mzj) _ ;;feexp(-^f/2) _ 2 

1 — ^i{zi) 1 — |erfc(— 2 (^/\/ 2 ) \/^erfcx( 2 (^/\/ 2 ) ’ 

where erfc(-) is the complementary error function and erfcx(-) 
is the scaled complementary error function. 

As stated previously, the asymptotic variances of the pa¬ 
rameters Of are the same as for 6. Therefore, the asymptotic 
variance of the parameters PL (do), n and are given by 
the three main diagonal elements of the matrix in Eq. 

For measurement data, an estimate of the asymptotic variance 
can be found by replacing the true parameter values with their 
estimates, PL (do), n and Estimates of the standard errors 
can then be obtained simply by taking the square root of the 
asymptotic variance. 

The standard errors of the estimated parameters depend on 
many different things, such as the pathloss sample distances, 
the level of the censoring, the number of samples as well as 
the exact values of PL (do), n and cr^. Therefore, it is often 
necessary to evaluate the standard errors for each individual 
measurement case. Implemented Matlab codes for the ML 
estimator and its asymptotic variance can be found in (TT). 


A. Asymptotic Variance of the ML estimator 

The asymptotic variance of the ML estimator has been 
derived in |T0| for the problem with censoring of samples 
where < 0. We therefore transform the data in Eq. 0, by 
letting 

Yt = —y + c = —Xa — € + c = Xat — e, (16) 

where 

OLt = [-PL{do) + c - n]^. (17) 

The parameters to be estimated for the transformed data are 
6, = [ext (18) 


The asymptotic variance for the ML estimates of the original 
parameters, 6, are however the same as for the parameters of 
the transformed data. Of. Therefore, we can directly use the 
equations found in p0| to calculate the asymptotic variance 
as 


Avar(0) = Avar(0t) = diag 


N 


-1 ' 


^ ^ Af(x^, Of) 


(19) 


a=i 


V. Results 

As an example, synthetic data at 5.6 GHz was generated 
according to Eq. 0 with known parameters (n = 2 and 
cr = 4) and a synthetic censoring level at c. The parameters 
were estimated using OLS and the ML method described 
above. The result is shown in Fig. 1. The OLS method 
clearly underestimates both the pathloss exponent, n, as well 
as the standard deviation of the large scale fading, a. The 
ML method on the other hand, is able to correctly estimate 
both parameters in this example. Fig. 2 shows the same thing 
as Fig. 1, but is for measured data from a vehicle-to-vehicle 
(V2V) measurement campaign for NLOS scenarios at 5.6 GHz 
(T4). In this case, the parameter estimates obtained using OLS 
show significantly smaller values compared to the parameter 
estimates for the ML method. This large discrepancy is due 
to the large number of censored samples in this data set; 
about 45 % of the measurement data points are censored. As a 
result, the OLS, which does not consider the censored samples, 
greatly underestimates the pathloss exponent and large scale 
fading. This shows the importance of taking censored samples 
into account when estimating the pathloss parameters. 
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Fig. 1. Pathloss estimation based on censored synthetic data using the 
ML estimation method that considers censoring and using OLS without 
considering censoring. The ML method produces accurate estimates, whereas 
the OLS method underestimates n and a. 
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Fig. 2. Pathloss estimation of censored measurement data, using ML and 
OLS estimation. The estimated standard error for the ML estimates are 
SE(PL(do)) = 0.72 dB, SE(n) = 0.04 and = 1.6. 

VI. Conclusions 

In this letter, we suggest the use of a Tobit ML method 0 
for the estimation of pathloss parameters based on censored 
data. When the data is censored, the standard approach of 
OLS, which has been widely used in the literature, is in¬ 
consistent, and yields biased estimates. The suggested ML 
estimator solves this problem by jointly estimating the pa¬ 
rameters based on a censored normal distribution. Equations 


for the standard errors of this estimator are also provided. 
Using these equations, we show that the sampling distribution 
of the measurement samples can have a significant effect on 
the standard error in typical pathloss measurements. Using 
synthetic pathloss data that is censored, we also show that 
the ML method is able to correctly estimate the pathloss 
parameters, whereas OLS is biased and underestimates the 
pathloss exponent and the large-scale fading variance. Lastly, 
by using measured pathloss data from a V2V measurement 
campaign, we see that the ML method yields drastically 
different and more realistic estimates compared to the OLS 
method. Additional results and Matlab codes can be found in 
the supplementary technical report GD- 
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